AITopics

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Software > Programming Languages (0.78)

arXiv.org Artificial IntelligenceDec-2-2025

DeepCAVE: A Visualization and Analysis Tool for Automated Machine Learning

Segel, Sarah, Graf, Helena, Bergman, Edward, Thieme, Kristina, Wever, Marcel, Tornede, Alexander, Hutter, Frank, Lindauer, Marius

Hyperparameter optimization (HPO), as a central paradigm of AutoML, is crucial for leveraging the full potential of machine learning (ML) models; yet its complexity poses challenges in understanding and debugging the optimization process. We present DeepCAVE, a tool for interactive visualization and analysis, providing insights into HPO. Through an interactive dashboard, researchers, data scientists, and ML engineers can explore various aspects of the HPO process and identify issues, untouched potentials, and new insights about the ML model being tuned. By empowering users with actionable insights, DeepCAVE contributes to the interpretability of HPO and ML on a design level and aims to foster the development of more robust and efficient methodologies in the future.

artificial intelligence, hutter, machine learning, (13 more...)

2512.0181

Country: Europe > Germany (0.15)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.46)

Singh, Prabhant, Gijsbers, Pieter, Yildirim, Elif Ceren Gok, Yildirim, Murat Onur, Vanschoren, Joaquin

Automated Machine Learning for Unsupervised Tabular Tasks

arXiv.org Artificial IntelligenceOct-14-2025

In this work, we present LOTUS (Learning to Learn with Optimal Transport for Unsupervised Scenarios), a simple yet effective method to perform model selection for multiple unsupervised machine learning(ML) tasks such as outlier detection and clustering. Our intuition behind this work is that a machine learning pipeline will perform well in a new dataset if it previously worked well on datasets with a similar underlying data distribution. We use Optimal Transport distances to find this similarity between unlabeled tabular datasets and recommend machine learning pipelines with one unified single method on two downstream unsupervised tasks: outlier detection and clustering. We present the effectiveness of our approach with experiments against strong baselines and show that LOTUS is a very promising first step toward model selection for multiple unsupervised ML tasks.

artificial intelligence, data mining, machine learning, (16 more...)

2510.07569

Genre: Research Report (0.82)

Industry: Health & Medicine > Therapeutic Area > Nephrology (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.94)
Information Technology > Data Science > Data Mining > Anomaly Detection (0.92)

Neural Information Processing SystemsSep-29-2025, 22:53:53 GMT

Probabilistic Matrix Factorization for Automated Machine Learning

In order to achieve state-of-the-art performance, modern machine learning techniques require careful data pre-processing and hyperparameter tuning. Moreover, given the ever increasing number of machine learning models being developed, model selection is becoming increasingly important. Automating the selection and tuning of machine learning pipelines, which can include different data pre-processing methods and machine learning models, has long been one of the goals of the machine learning community. In this paper, we propose to solve this meta-learning task by combining ideas from collaborative filtering and Bayesian optimization. Specifically, we use a probabilistic matrix factorization model to transfer knowledge across experiments performed in hundreds of different datasets and use an acquisition function to guide the exploration of the space of possible ML pipelines. In our experiments, we show that our approach quickly identifies high-performing pipelines across a wide range of datasets, significantly outperforming the current state-of-the-art.

automated machine learning, name change, probabilistic matrix factorization, (3 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Francia, Riccardo, Leone, Maurizio, Leonardi, Giorgio, Montani, Stefania, Pennisi, Marzio, Striani, Manuel, D'Alfonso, Sandra

AutoML-Med: A Framework for Automated Machine Learning in Medical Tabular Data

arXiv.org Artificial IntelligenceAug-5-2025

In recent years, the advent of deep learning and, in particular, transformer-based architectures, has significantly revolutionized the field of Artificial Intelligence (AI) in many scientific domains, including computer vision, natural language processing, and sequence modeling, thanks to the increasing availability of computational power and large-scale data-sets. However, classical Machine Learning (ML) methods, such as decision trees, gradient-boosted trees, Support V ector Machines (SVMs), and regression--based techniques, continue to be considered as the state-of-the-art for tabular data, which are still nowadays widely used in healthcare, finance, industrial monitoring, and other structured-data domains. There are several reasons for this. Notably, conventional AI models tend to perform reasonably well on datasets of limited size, whereas state-of-the-art deep learning techniques typically require substantially larger amounts of data to generalize effectively. Moreover, many classical AI methods, such as regression, Bayesian approaches, rule-based systems, and tree-based models, are inherently more interpretable, a characteristic that is particularly valuable in high-stakes domains such as healthcare. In contrast, deep learning models often work as black boxes, limiting their explainability. As an example, Grinsztajn et al. [1] showed that tree-based ensembles like XGBoost and Random Forests consistently outperformed a wide range of contemporary deep learning models across dozens of medium-sized tabular datasets (

artificial intelligence, deep learning, machine learning, (16 more...)

2508.02625

Country:

Europe (0.68)
North America > United States (0.46)

Genre: Research Report > Experimental Study (0.93)

Industry:

Health & Medicine > Therapeutic Area > Immunology (0.93)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.93)
Health & Medicine > Consumer Health (0.68)
(2 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Neural Information Processing SystemsMay-26-2025, 14:44:20 GMT

PyGlove: Symbolic Programming for Automated Machine Learning

artificial intelligence, search algorithm, search space, (9 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.91)

Moin, Armin, Wattanavaekin, Ukrit, Lungu, Alexandra, Rössler, Stephan, Günnemann, Stephan

Automated Machine Learning: A Case Study on Non-Intrusive Appliance Load Monitoring

arXiv.org Artificial IntelligenceMay-13-2025

We propose a novel approach to enable Automated Machine Learning (AutoML) for Non-Intrusive Appliance Load Monitoring (NIALM), also known as Energy Disaggregation, through Bayesian Optimization. NIALM offers a cost-effective alternative to smart meters for measuring the energy consumption of electric devices and appliances. NIALM methods analyze the entire power consumption signal of a household and predict the type of appliances as well as their individual power consumption (i.e., their contributions to the aggregated signal). We enable NIALM domain experts and practitioners who typically have no deep data analytics or Machine Learning (ML) skills to benefit from state-of-the-art ML approaches to NIALM. Further, we conduct a survey and benchmarking of the state of the art and show that in many cases, simple and basic ML models and algorithms, such as Decision Trees, outperform the state of the art. Finally, we present our open-source tool, AutoML4NIALM, which will facilitate the exploitation of existing methods for NIALM in the industry.

appliance, artificial intelligence, machine learning, (16 more...)

doi: 10.1007/978-3-031-89063-5_22

2203.02927

Country: North America > United States (0.68)

Genre:

Research Report > Promising Solution (0.49)
Overview > Innovation (0.35)

Industry: Energy > Power Industry (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.31)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.31)

Neural Information Processing SystemsJan-21-2025, 02:45:04 GMT

Review for NeurIPS paper: PyGlove: Symbolic Programming for Automated Machine Learning

Summary and Contributions: The paper introduces an AutoML library that tries to find its own sweet spot in the large ecosystem of newly minted AutoML libraries. The paper introduces a symbolic frontend to build neural network models, with simple fundamental constructs that provide choice insertions. Unlike all other packages that I have seen and reviewed, such as Keras Tuner, NNI, AutoGluon, Optuna (btw reference missing to Optuna, you should consider adding), this paper introduces something innovative and elegant. All these other packages consistently suffer from the code of the model definition getting ugly and unweildy really quickly when you have to introduce model structure searches, and when there's interaction between structure searches and size searches. In this paper, the authors cleanly separate model structure definitions from each layer's hyperparameter choices.

automated machine learning, neurips paper, symbolic programming, (7 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.60)
Information Technology > Artificial Intelligence > Systems & Languages > Programming Languages (0.40)

Neural Information Processing SystemsJan-21-2025, 02:44:57 GMT

Review for NeurIPS paper: PyGlove: Symbolic Programming for Automated Machine Learning

The reviewers generally agree that the design choices of this framework for AutoML are judicious and hit a "sweet spot". This combination of language/tooling design is of great value to expose to large swathes of the NeurIPS community. The rebuttal persuasively addresses the reviewers' concerns about the evaluation and utility of this proposal, and the response to R4 is also reassuring. We look forward to the authors' final version of the paper, incorporating the proposed improvements.

automated machine learning, neurips paper, symbolic programming, (1 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.85)
Information Technology > Artificial Intelligence > Systems & Languages > Programming Languages (0.40)

Neural Information Processing SystemsOct-9-2024, 09:25:57 GMT

PyGlove: Symbolic Programming for Automated Machine Learning

algorithm, search algorithm, search space, (8 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.91)